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Abstract. A Boolean function is called read-once over a basis B if it 
can be expressed by a formula over B where no variable appears more 
QO ■ than once. A checking test for a read-once function / over B depending 

CNj ' on all its variables is a set of input vectors distinguishing / from all other 

read-once functions of the same variables. We show that every read-once 
function / over B has a checking test containing 0(n l ) vectors, where n is 
the number of relevant variables of / and I is the largest arity of functions 
in B. For some functions, this bound cannot be improved by more than a 
constant factor. The employed technique involves reconstructing / from 
its /-variable projections and provides a stronger form of Kuznetsov's 
classic theorem on read-once representations. 
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CO . Let B be an arbitrary set of Boolean functions. A function / is called read-once 

over B iff it can be expressed by a formula over B where no variable appears 
more than once. 

Let f{xx,...,x n ) be a read-once function over B that depends on all its 
variables. Then a set M of ro-bit vectors is called a checking test for / iff for any 

K^ ' other read-once function g(xi, . . . , x n ) over B there exists a vector a € M such 

$_i . that f(a) ^ g(oi). In other words, M is a checking test for / iff the values of 

/ on vectors from M distinguish / from all other read-once functions g of the 
same variables. Note that all these alternatives g, unlike the target function f, 
may have irrelevant variables. 

Denote by Bi the basis of all Z-variable Boolean functions. The goal of this 
paper is to prove that all n-variable read-once functions over Bi have checking 
tests containing 0(n l ) vectors. More generally, for an arbitrary basis B and a 
read-once function / over B, denote by Ts(f) the smallest possible number of 
vectors in a checking test for /. This value can be regarded as the checking test 
complexity of /. We show that 



T Bl {f( Xl ,...,x n ))<2 l -(^j 



and, therefore, for any finite basis B and any sequence of read-once functions /„ 
of n variables over B, it holds that Tg(/n) = 0(n l ) asn-> oo, where I is the 
largest arity of functions from B. 

This result is based on the previously known relevance hypercube method by 
Voronenko |15) . Our main contribution is the proof that the method is correct 
for all bases Bi for an arbitrary I, i.e., it provides a way to construct checking 
tests of specified length (cardinality) for all read-once functions over these bases. 
Previous results give proofs only for I < 5 [14115118] , 

It should be pointed out that the bound Tb,(/) = 0(n l+l ) can be extracted 
from the related paper on the exact identification problem by Bshouty, Hancock 
and Hellerstein [3]. Our result has the following advantages. Firstly, for some 
functions the bound 0(n l ) cannot be improved by more than a constant factor 
(it matches the known lower bound il(n l ) for n-ary disjunction up to a constant 
factor). Secondly, checking tests constructed by the relevance hypercube method 
have regular structure. In short, we show that every read-once function can be 
unambiguously reconstructed from a set of its Z-variable projections with certain 
properties. This fact may look natural at first sight, but turns out a tricky thing 
to prove after all. 

2 Background and Related Work 

Our result has some interesting consequences related to computational learn- 
ing theory. For instance, it is known that checking tests can be used to imple- 
ment equivalence queries from Angluin's learning model [2]. For the problem of 
identifying an unknown read-once function over an arbitrary finite basis B with 
queries, it turns out that non-standard subcube identity queries can be efficiently 
used [7] . A subcube identity query basically asks whether a specified projection 
of the unknown function / is constant, i.e., whether a given partial assignment 
of constants to input variables unambiguously determines the value of /. 

It follows from our results that for any finite basis B, the problem of learning 
an unknown read-once function over B can be solved by an algorithm making 
0{n l+2 ) membership and subcube identity queries, which is polynomial in n 
(here I is the largest arity of functions in B and a standard membership query is 
simply a request for the value of / on a given input vector). This result builds 
upon an algorithm by Bshouty, Hancock and Hellerstein [4], which is a strong 
generalization of a classic exact identification algorithm by Angluin, Hellerstein 
and Karpinski [3]. 

Closely related to the notion of checking test complexity is the definition of 
teaching dimension introduced by Goldman and Kearns [12) . A teaching sequence 
for a Boolean concept (a Boolean function /) in a known class C is a sequence of 
labeled instances (pairs of the form (a, /(a))) consistent with only one function 
/ from C. The teaching dimension of a class is the smallest number t such that 
all concepts in the class have teaching sequences of length at most t. 

In test theory, which dates back to 1950s [6], the corresponding definition is 
that of the Shannon function for test complexity, which is the largest test com- 



plexity of an n-variable function. In these terms, our Corollary can be restated 
as follows: TB,(n) = 0(n l ), where Ts^n) is the Shannon function for checking 
test complexity of read-once Boolean functions (i.e., the maximum of Tb, (/) 
over all n-variable read-once functions over B{). It must be stressed that in the 
definition of a checking test used in this paper, the target function is required to 
depend on all its variables. (One needs to test all 2™ input vectors to distinguish 
the Boolean constant from all read-once conjunctions of n literals.) 

Another appealing problem is that of obtaining bounds on the value of Tb(/) 
for individual read-once functions /. The bound Tb^xi V ... V x n ) = O(n') 
is obtained in [15) and generalized in [16] . In [8], it is shown that for a wide 
class of bases including Bi, I > 2, there exist pairs of read-once functions /, /' 
such that /' is obtained from / by substituting a constant for a variable and 
Tb(I') > Tb{J)- This result shows that lower bounds on Tb(J) cannot generally 
be obtained by simply finding projections of / that are already known to require 
a large number of vectors in their checking tests. 

For the basis B2, individual bounds on the checking test complexity are 
obtained in [17] . In [19] , it is shown that almost all read-once functions over the 
basis {V,©} have a relatively small checking test complexity of O(nlogn), as 
compared to the maximum of Q(n 2 ) (even if alternatives are arbitrary read-once 
functions over B^ and not necessarily read-once over {V,©}). For the standard 
basis {A, V, -1}, it is known that ra + 1 < Tr AV _,i (n) < 2n + l [10], and individual 
bounds can be deduced from those for the monotone basis {A, V} [5|9] . 



3 Basic Definitions 

A variable Xi of a Boolean function f(x\, . . . , x n ) is called relevant (or essential) 
if there exist two n-bit vectors a and /3 differing only in the ith component such 
that /(a) ^ f(P)- If Xi is relevant to /, then / is said to depend on Xi. 

In this paper, we call a pair of functions f(x±, . . . , x n ) and g(yi, ■ ■ . , y n ) 
similar if for some constants <r, <xi, . . . , <j n £ {0, 1} and for some permutation 7r 
of {1, ... , n} the following equality holds: 

f(Xl,. ..,X n ) =9 (*„.(!)) ■ • ■ J^n))' 

where z T stands for z if r = 1 and for z if r = 0. A Boolean function f(x± , . . . , x n ) , 
n > 3, is called prime if it has no decomposition of the form 

f(xi,...,x n ) = g(h(x„(i), . . . ,x^ k ))> x^ k+ i),...,x^ n )) , 

where 1 < k < n and n is a permutation of {1, . . . , n}. 

The structure of formulae expressing read-once functions can be represented 
by rooted trees. A tree of a read-once function f(xi, . . . , x n ) over B\ has n leaves 
labeled with literals of different variables and one or more internal nodes labeled 
with functions from Bi and symbols o G {A,V,©} of arbitrary arity (possibly 
exceeding I). We assume without loss of generality that such trees also have the 
following properties: 



1) any internal node is labeled either with a prime function or with a symbol 
o e {A,V,©}; 

2) internal nodes labeled with identical symbols o £ {A, V, ©} are not adjacent. 

One can readily see that every read-once function over Bi has at least one tree 
of this form. 

In the sequel, variables are usually identified with corresponding leaves in the 
tree. Denote by lca(j/i, . . . , y m ) the least common ancestor of variables j/i, . . . , y m , 
i.e., the last common node in (simple) paths from the root of the tree to 

yi,---,y m - 

Suppose that T is a tree of a read-once function and v is its internal node. 
By T v we denote the subtree ofT rooted at v, i.e., the rooted tree that has root v 
and contains all descendants of v. Hwi,...,w p are children (direct descendants) 
of v, then subtrees T W1 , . . . ,T W are called subtrees of the node v. If a; is a leaf 
of T contained in T v , then by T£ we denote a (unique) subtree T w . containing 
x. Finally, subtrees of the root node of a tree are called root subtrees. 

4 The Relevance Hypercube Method 

This section is devoted to the review of the relevance hypercube method proposed 
by Voronenko in 15 . This method has been known to be correct for the bases 
Bi if Z<5 (see (151151 ). 

From now on, we will use the term "read-once function" instead of "read- 
once function over Bi" . We use boldface letters to denote vectors (often treated 
as sets) of variables. 

Let / be a read-once function depending on variables x = {x\, . . . ,X n }. 
Suppose that H is a set of 2 l input vectors disagreeing at most in iith, . . . , ijth 
components such that the restriction of / to H (which is an Z-variable Boolean 
function) depends on all its I variables x' = {xi x , . . . ,Xj,}. Then H is called 
a relevance hypercube (or an essentiality hypercube) of dimension I for these 
variables x'. Any relevance hypercube can be identified with a partial assignment 
p of constants to input variables such that the induced projection f p depends on 
all its I variables. Such assignments are called Z -justifying in [4]. 

Remark. For some functions / and some subsets of their variables relevance 
hypercubes do not exist. For instance, one may easily check that the func- 
tion d(x, Uo,U\) — (x A Uo) V (a; A u\) has no relevance hypercubes for the set 
u = {uq, ui}. As indicated below, the absence of relevance hypercubes is a ma- 
jor obstacle to proving the correctness of the relevance hypercube method (see 
also |15|18j ). At the same time, for some functions there exist subsets of vari- 
ables with more than one relevance hypercube. An example is given by the same 
function d and the set u' = {x,uq}. 

Any set M of input vectors is called a relevance hypercube set of dimension 
I for / if it contains a relevance hypercube H for every Z-sized subset of x, for 
which such a hypercube exists. (Recall that x is the set of all variables relevant to 



/.) In other words: consider all /-sized subsets w C x such that / has a relevance 
hypercube for w. A set M is a relevance hypercube set iff M contains at least 
one relevance hypercube for each subset w of this kind. It was conjectured that 
any such set is a checking test for /. 

Suppose that / is a read-once function that depends on n variables x. Con- 
struct a relevance table with (?) rows and two columns by the following rule. 
First, fill the first cells of all rows with different /-sized subsets of x. Then for 
each row, if the first cell contains a subset w C x, put in the second cell any 
relevance hypercube for w (along with the corresponding values of /) if such a 
hypercube exists, or the symbol * otherwise. 

In |15j . it is shown that any (valid) relevance table uniquely determines a 
read-once function in the following sense. Suppose that one knows that a function 
g is read-once and agrees with / on all relevance hypercubes from a relevance 
table E for /. If one also knows that for each *-row in E the function g does not 
have a relevance hypercube, then one can recursively reconstruct the skeleton of 
g (which is a tree T" such that negating some of its nodes' labels yields a correct 
tree representing g). After that, the values of / on the vectors from E allow one 
to prove that g is equal to /. 

It follows that a relevance hypercube set M of dimension / for a read-once 
function / is indeed a checking test for / if E contains no *-rows. If for some 
/-sized subset of variables w no relevance hypercube exists, then a more sophis- 
ticated technique is needed to prove that / can still be reconstructed from its 
values on vectors from M. The approach used in this paper is outlined in the 
following Section [SJ 



5 Assumptions and Notation 

In this section we make preliminary assumptions and introduce some notation. 
All subsequent work, including the proof of our main theorem, is based on the 
material presented here. 

We start with an arbitrary read-once function / over Bi, where / > 3. Let x 
be the set of variables relevant to /. Suppose that M is a relevance hypercube 
set of dimension I for /. Our goal is to prove that M is a checking test for /, 
i. e., for any other read-once function g(x) there exists a vector a E M such that 
f(a) ^g(a). 

Take any read-once function g(x) that agrees with /(x) on all vectors from 
M. Firstly and most importantly, we need to prove that root nodes of these two 
functions' trees are labeled with similar functions. If either of the root nodes 
is labeled with a symbol o e {A, V, ©}, then this can be done with the aid of 
techniques similar to those from [15]. Here we focus on the prime case, i. e, we 
assume that 



/(x) = /°(/ 1 (x 1 ),...,/ 5 (x s )) ) 



where both f° and g° are prime, and x 1 U . . . Ux s and y 1 U . . . Uy r are partitions 
of x = y. (Technically, we must first assume that y C x, but it is easily shown 
that no variable from x can be irrelevant to g\ see, e.g., Proposition [2] in the 
next section.) Note that here s < I and r < I. 

Suppose we have already proved that f° and g° are similar. As our second 
step, we need to show that partitions of input variables into subtrees are identical 
in the representations above. In other words, we need to show that each y k is 
equal to some x\ 

These two steps, especially the first one, constitute the main difficulties in 
proving the correctness of the method. The remaining part is technical and can 
be done with the aid of induction on the depth of the tree representing /. A short 
explanation of how this part is done is given at the end of our main theorem's 
proof in Section \8\ 

In the following sections, we will need the colouring of input variables x 
defined by the following rule. To each variable x G x we assign a (unique) colour 
k G {1, . . . , r} such that x G y k . This definition provides a convenient way of 
relating functions / and g (i.e., their tree structure) to each other. 

6 Some Observations 

In this section we present three facts needed for the sequel. A key observation is 
given by the following proposition. 

Proposition 1. Suppose that g' is a projection of g that depends on variables 
x and y having the same colour k. Also suppose that lca(x,y) = v in a tree T" 
of g' . Then, if v is labeled with a prime function h, it follows that all leaves in 
the subtree (T') v have the same colour k. Otherwise, if v is labeled with a symbol 
o G {A, V, ©}, it follows that all leaves in subtrees {T')% and iT')^ have the same 
colour k. 

Proposition [T] follows from a simple fact that substitutions of constants for vari- 
ables of g can result in removing nodes and subtrees from T", or in replacing 
nodes with trees that represent projections of prime functions. Adjacent nodes 
labeled with identical symbols o g {A, V, ©} are subsequently glued together, but 
least common ancestors of each y* either remain roots of single-coloured sub- 
trees, or "support" subsets of single-coloured subtrees of internal nodes labeled 
with o G {A, V,©}. 

For technical reasons, we will also need the following proposition, which holds 
true for all discrete functions (not necessarily read-once or even Boolean) and 
follows from Theorem B in |11) . 

Proposition 2. Suppose that f is an arbitrary function depending on n vari- 
ables x. Also suppose that there exists a relevance hypercube for some p variables 
u C x. Then for every q such that p < q < n there exists a relevance hypercube 
for some q-sized set of variables w such that u G w C x. 

Last but not least, we will use the following fact (see, e.g., [15]). 



7 

Proposition 3. Suppose that a read- once function f is represented by a tree and 
p variables u are taken from p different subtrees of an internal node v, which is 
labeled with a prime function of arity p or with a symbol o £ {A, V, ©}. Then f 
has at least one relevance hypercube for u and, moreover, restrictions of f to all 
such hypercubes are: 

(a) similar to h{z\, . . . , z p ) if v is labeled with a prime function h; 

(b) similar to Z\ o . . . o z p if v is labeled with a symbol o G {A, V, ©}. 

7 Auxiliary Lemmas 

Suppose that read-once functions / and g satisfy all the assumptions made in 
Section [5] and T is a tree of /. Recall that by x we denote the set of all variables 
relevant to /. We say that a set u C x is stable iff for any set w such that 
u C w C x and any relevance hypercube H for w there exists a relevance 
hypercube H' for u such that H' C H. 

Remark. The definition of a stable set u does not require the existence of rel- 
evance hypercubes for all sets w such that u C w C x. What is says is that if 
there exists such a relevance hypercube H , then there exists a relevance hyper- 
cube for u which is a subcube of H. For w = x, however, the definition requires 
that at least one relevance hypercube for u exists. 

One can readily observe that all singleton subsets of x are stable. Examples of 
sets that are not stable are given by u = {uq,ui} (as witnessed by w = uU{y}) 
for functions f\ — (xAd(y,uo,ui))\/(xA(y\/uo\/ui)) and fi = (xAd(y, uo,tti))V 
(x A (uq V Ui)), where d{y, uq, ui) = (y A u Q ) V(j/A ui). Our main ingredient in 
the proof of the main theorem is given by the following lemma, which allows us 
to establish a link between the tree structure of our functions / and g. 

Lemma 1. For any stable set u of at most I variables of the function f, the 
function g agrees with f on some relevance hypercube for u. 

Proof. We start with the definition of a stable set. First choose w = x and 
conclude that / has a relevance hypercube for u. By Proposition [21 / also has 
a relevance hypercube for some /-sized set of variables w' such that u C w'. It 
follows that M contains some relevance hypercube for w'. Since / and g agree 
on all vectors from M, and u is stable, it also follows that / and g agree on some 
relevance hypercube for u. This concludes the proof. 

More than once we will need subsets of input variables having specific structure. 
We call a set u conservative iff for each internal node v of T labeled with a prime 
function h the number of subtrees of v containing at least one variable from u 
is equal either to 0, or to 1, or to the arity of h. To understand the intuition 
behind this term, consider the restriction of / to any relevance hypercube for 
such a set. In the tree of such a restriction, each node of T labeled with a prime 
function is either preserved or discarded, i.e., no constant substitutions and 
further transformations occur at these nodes. 



Lemma 2. All conservative sets are stable. 

Proof. Let u be a conservative set of variables. We say that an internal node of T 
is a branching node for u iff at least two subtrees of v contain leaves from u. The 
proof is by induction over the number b of branching nodes for u in T . If b = 0, 
then |u| < 1 and there is nothing to prove. Suppose that b > 1 and v — lca(u). 
Then v is a branching node and all other branching nodes are descendants of v. 
Therefore, / can be expressed by a formula 

h (z°, /i(/ ll (z 1 ),...,ft m (z m ))) I 

where h is the function corresponding to the label of v, sets z J and z J are disjoint 
for i ^ j and (by our definition of a conservative set) variables u are contained 
in each z 1 , i > 1, but not in z°. 

Identify a relevance hypercube H for some variables w (here u C w) with 
a partial assignment p of constants to variables x. Split p into po,pi, ■ ■ ■ ,p m 
according to the partitioning given by z°, z 1 , . . . , z m . Subsets of u contained 
in z', j > 1, are conservative for trees representing functions hi(z l ), and the 
number of branching nodes in any such tree is at most 6—1. By the inductive 
assumption, there exist partial assignments p[ , . . . , p' m which are extensions of 
Pi,...,Pm an d restrict relevance hypercubes for these subsets. Projections h' t 
induced by p\ depend on these subsets of u. If we now choose an extension p' of 
Po taking ho(z°,u) to a literal of u, the composition oip' ,p[, . . . ,p' m will restrict 
the needed relevance hypercube H' . This concludes the proof. 

In Section [5] we defined the colouring of variables induced by the read-once 
representation of g. The following lemmas reveal some properties of this colouring 
that are related to the structure of T. These properties reflect the observation 
formulated in Proposition [TJ 

Lemma 3. Suppose that variables x and y both have colour k. Also suppose that 
the node v = lca(x, y) in T is labeled with a prime function h. Then all the leaves 
in the subtree T v have the same colour k. 

Proof. Let m be the arity of h. Take arbitrary variables Zi,...,z m —2 such 
that x, y, Zi, . . . , z m -2 are contained in m different subtrees of v. The set u = 
{x, y, Zi, . . . , Zm-2} is conservative and, therefore, stable (by Lemma|2}. Since h 
is prime and / is read-once over Bi, we see that m < Lit follows from Lemma Q] 
that / agrees with g on some relevance hypercube for u. By Proposition [3l the 
restriction of / to any such hypercube is similar to h. Therefore, some projec- 
tion g 1 of g is represented by a tree T" with exactly one internal node, which is 
labeled with a prime function. Variables x and y are relevant to g' and have the 
same colour k. It then follows from Proposition Q] that all variables z\, ■ ■ ■ , z m -2 
have colour k too. Since zi, ■ ■ ■ , Zm-2 were chosen arbitrarily from their subtrees, 
we obtain that all leaves in T^ 1 , . . . ,Ty m ~ 2 have colour k. Repeating the same 
reasoning for initial pairs x, z\ and y, z\ in place of x, y reveals that all leaves in 
T% and T£ also have the same colour k. This concludes the proof. 
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Lemma 4. For each colour k G {l,...,r} ; there exists a unique index i G 
{1, . . . , s} such that y k C x*, i.e., all variables coloured with k belong to the 
set x l . 

Proof. Take any two variables x and y having colour k. If they do not belong 
to the same x l , then they belong to different root subtrees of T. Therefore, the 
node \c&(x,y) is labeled with a prime function f°. By LemmaEJ all leaves of T 
have the same colour, which is a contradiction. 

Another way to state Lemma [4] is to say that the partition y 1 U . . . U y r is a 
refinement of x 1 U . . . U x s . 

Lemma 5. For any non-root internal node v inT labeled with a prime function 
h of arity r or greater, all leaves of T v have the same colour. 

Proof. Let v be an internal node of T labeled with a prime function h of arity 
at least r. If not all leaves of T v have the same colour, then by Lemma [3] any 
two leaves from different subtrees of v have different colours. It then follows that 
leaves of T v are coloured with at least r colours. By Lemma IH leaves of other 
root subtrees of T cannot be coloured, since all colours are taken from the set 
{1, . . . ,r}. This contradiction concludes the proof. 

Lemma 6. Suppose that variables x and y both have colour k. Also suppose that 
in T the node v = lc&(x,y) is labeled with a symbol o g {A,V,©}. Then all the 
leaves in T® and T% have the same colour k. 

Proof. Define the depth of a subtree as the maximum number of edges on (short- 
est) paths from its root to its leaves. Let d be the depth of T v . The proof is by 
induction over d. For d = 1, there is nothing to prove. Suppose that d > 2 
and T^ contains a leaf z having colour k' ^ k. We claim that for some m > 
there exist variables zq, z%, ... , z m such that the set u = {x, y, zq, z%, . . . , z m } is 
conservative, has cardinality at most I, and the restriction of / to any relevance 
hypercube for u can be obtained by negating the inputs and/or the output of 
some function xo f'(y, Zq, Z\, . . . , z m ), where the colours of y and Zg are different 
and /' is either a prime function or a binary function (m — 0) from {A, V, ©} 
different from o. 

First suppose that the root w of T$ is labeled with a prime function h. 
Since not all leaves of T^f have the same colour, it follows from Lemma |3] that 
colours of leaves taken from different subtrees of T% are different. Take arbitrary 
variables zo, Zi, . . . , z m , m > 1, from all subtrees except X^ (one variable from 
each subtree). Now y and zq have different colours and rn+2 < r— 1 by Lemma[5j 
One can easily see that the set u constructed in this way is conservative by 
definition and has cardinality at most Z, because r < I. Proposition [3] then 
reveals that the restriction of / to any relevance hypercube for u indeed has the 
needed form. 

Now consider the case when w is labeled with a symbol • e {A,V,©}. By 
definition of a tree representing a read-once function, * is different from o. Ob- 
serve that the depth of the subtree T w is less than or equal to d — 1, so we can 
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use the inductive assumption for T w . If y and z belong to the same subtree T" 
of w, then it follows that only leaves from T" can have the same colour as y. In 
this case, any leaf from any other subtree can be chosen to be zq. In the other 
case, if y and z belong to different subtrees, simply put zq — z. One can now see 
that m = and u = {x, y, zq} satisfy all the stated conditions. 

Now apply Lemma Q] to the set u (recall that all conservative sets are stable 
by Lemma [2]). It follows that g agrees with / on some relevance hypercube for 
u. By our choice of u, this means that g has a projection g' of the form specified 
above. In the tree of g', the root is adjacent to the leaf labeled with a literal of 
x and to the other internal node, whose children are y, zq, zi, . . . , z m . Since x 
and y have the same colour, it follows from Proposition [T] that zq has the same 
colour as x, which contradicts our choice of Zq. This completes the proof. 

8 Main Theorem 

Theorem. For any read-once function f over Bi, I > 3, depending on all its 
variables, any relevance hypercube set of dimension I for f constitutes a checking 
test for f . 

Proof. Let M be a relevance hypercube set of dimension I for /. By g denote an 
alternative read-once function that agrees with / on all vectors from M. Suppose 
that / and g satisfy all the assumptions made in Section [5] and let T be a tree of 
/. Choose a subset u of T's leaves according to the following (non-deterministic) 
recursive rules: 

1. Put u = u(«o), where vo is the root of T. 

2. If all leaves in T v have the same colour, then u(v) = {xi} for some leaf Xi 
contained in T v . 

3. Otherwise: 

(a) if v is labeled with a prime function h, then u(v) = (Ju(i>i) over all 
children v- t of v; 

(b) otherwise, if v is labeled with a symbol o g {A,V,©}, then u(v) = 
u'(v) U u"(v), where u'(v) = {Ju(vi) over all multi-coloured subtrees 
T Vi of v, and u"(v) = \Ju(vj) over some subset of all single-coloured 
subtrees T v . of v that contains one subtree of each colour. 

One can easily see that u is conservative and, by LemmaEJ stable. By Lemmas[3] 
andEl it contains exactly r leaves. It follows from Lemma Q] that g agrees with 
/ on some relevance hypercube H for u. Since all elements of u have different 
colours, it follows from Proposition [3] that the restriction of g to H is similar to 
g°. On the other hand, u contains at least one leaf from each root subtree of T, 
so the restriction of / to H has the form 

/' = A/l, ...,f's), 

where functions f[ depend on disjoint sets of variables from u. These two re- 
strictions are equal, so /' is a prime function, which is only possible if r = s and 
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g° is similar to f°. By Lemma IU sets of root subtrees' variables are the same 
for / and g. This means that 

5 (x) = /°(. 9 i(x 1 ),...,^(x s )). 

Since a relevance hypercube set for / contains relevance hypercube sets for all 
functions /^x 1 ), . . . , / s (x s ) (or their negations) regarded as projections of /, the 
whole argument can be repeated recursively. In the end, one sees that / and g 
can be expressed by the same formula, and so / = g. This concludes the proof. 

Corollary. For any read- once function f over Bi depending on n variables it 
holds that 

T Bl (f)<2 l -( n \=0(n l ). 



9 Discussion 

It is interesting to note that our result gives a stronger form of Kuznetsov's 
classic theorem on read-once representations [13]. The original result can be 
reformulated as follows: for any given Boolean function / and any two trees T\ 
and T2 representing /, there exists a one-to-one correspondence 4> between the 
sets of internal nodes of T\ and T 2 such that the functions represented by each 
pair of matching nodes are either equal or each other's negations. This fact was 
independently proved by Aaronson pQ, who also developed an 0(N l ° S2 3 log N) 
algorithm for transforming the truth table of / into such a tree. Note that a 
sequence of 0(iV)-sized circuits that check the existence of and output read- 
once representations over B\ for any fixed I was constructed in [15) . In these 
results N = 2™ is the input length. 

Now suppose Ti and T 2 are trees representing n- variable Boolean functions 
/1 and fi , and it is known a priori that these trees do not contain nodes labeled 
with prime functions of arity greater than I. Our technique reveals that in order 
to prove the existence of a correspondence </> it is sufficient to verify that f± and 
/2 agree on an 0(n')-sized set of input vectors. While Kuznetsov's theorem does 
not concern itself with computational issues, our theorem shows that only a small 
fraction of input vectors (in fact, a polynomial number of them, as compared to 
the total of 2 n ) is needed to certify the "similarity" of the trees. 
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